PLASER: Pronunciation Learning Via Automatic Speech Recognition
نویسندگان
چکیده
PLASER is a multimedia tool with instant feedback designed to teach English pronunciation for high-school students of Hong Kong whose mother tongue is Cantonese Chinese. The objective is to teach correct pronunciation and not to assess a student’s overall pronunciation quality. Major challenges related to speech recognition technology include: allowance for non-native accent, reliable and corrective feedbacks, and visualization of errors. PLASER employs hidden Markov models to represent position-dependent English phonemes. They are discriminatively trained using the standard American English TIMIT corpus together with a set of TIMIT utterances collected from “good” local English speakers. There are two kinds of speaking exercises: minimal-pair exercises and word exercises. In the word exercises, PLASER computes a confidence-based score for each phoneme of the given word, and paints each vowel or consonant segment in the word using a novel 3-color scheme to indicate their pronunciation accuracy. PLASER was used by 900 students of grade 7 and 8 over a period of 2–3 months. About 80% of the students said that they preferred using PLASER over traditional English classes to learn pronunciation. A pronunciation test was also conducted before and after they used PLASER. The result from 210 students shows that the students’ pronunciation skill was improved. (The statistics is significant at the 99% confidence level.) ∗Mr. Tam is now a graduate student at the Department of Computer Science at Carnegie Mellon University. †Mr. Chan is now working at SpeechWorks Inc.
منابع مشابه
Explicit Pronunciation Training Using Automatic Speech Recognition Technology
A system is described, provisionally named Pronto, which uses automatic speech recognition (ASR) for training pronunciation of second languages in adult learners. The first version of Pronto was developed for native speakers of American English learning Spanish and for Mandarin Chinese speakers learning English. Pronto grows out of work in the Indiana Speech Training Aid (ISTRA) research progra...
متن کاملAutomatic pronunciation error detection and guidance for foreign language learning
We propose an e ective application of speech recognition to foreign language pronunciation learning. The objective of our system is to detect pronunciation errors and provide diagnostic feedback through speech processing and recognition methods. Automatic pronunciation error detection is used for two kinds of mispronunciation, that is mistake and linguistical inheritance. The correlation betwee...
متن کاملTowards Automatic Mispronunciation Detection in Singing
A tool for automatic pronunciation evaluation of singing is desirable for those learning a second language. However, efforts to obtain pronunciation rules for such a tool have been hindered by a lack of data; while many spokenword datasets exist that can be used in developing the tool, there are relatively few sung-lyrics datasets for such a purpose. In this paper, we demonstrate a proof-of-pri...
متن کاملAutomatic assessment of children speech to support language learning
Focus of this work are pattern recognition related aspects of computer assisted pronunciation training (CAPT) for second language learning. An overview of commercial systems shows that pronunciation training is being addressed by the growing eld of computer assisted language learning only to a small extend, although in the state-of-the-art section a number of such approaches for automatic asses...
متن کاملAutomatic rule-based generation of word pronunciation networks
In this paper a method for generating word pronunciation networks for speech recognition is proposed. The networks incorporate different acceptable pronunciation variants for each word. These variants are determined by applying pronunciation rules to the standard pronunciation of the words. Instead of a manual search, an automatic learning procedure is used to compose a sensible set of rules. T...
متن کامل